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Background 


•  The  recent  synthesis  of  a  wide  variety  of  cyanate  ester  monomers  that  may  be 
derived  from  renewable  resources  has  created  a  newly  available  set  of 
chemical  structures  that,  due  to  a  wide  range  of  chemical  features,  provide  a 
unique  opportunity  for  the  development  of  quantitative  structure-property 
relationships  for  dicyanate  esters  and  their  polymerized  networks. 

•  Specific  structure-property  relationships  for  monomer  melting  point,  glass 
transition  temperature  at  full  conversion,  and  char  yields  at  600  °C  under 
nitrogen  and  air  have  been  developed  with  the  aid  of  partial  least  squares 
methods. 

•  The  predictions  inherent  in  these  structure-property  relationships  are  examined 
and  compared  to  predictions  based  on  ordinary  least  squares  methods. 

•  Specific  predictions  for  the  properties  of  two  as-yet  unsynthesized  dicyanate 
ester  monomers  derived  from  renewable  resources  are  also  presented. 
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Materials  and  Methods 


•  Materials  and  Data  Sources 

-The  19  monomers  /  networks  chosen  for  analysis  contain  only  the  elements  C,  H,  N, 
and  O. 

-All  feature  two  aromatic  rings  containing  one  or  more  cyanate  ester  /  cyanurate 
linkages,  connected  by  a  bridge  containing  only  non-aromatic  hydrocarbons. 

-Two  of  the  monomers  are  derived  from  petroleum;  they  are  the  well-known  Primaset 
BADCy  and  Primaset  LECy  (dicyanate  esters  of  bisphenol  A  and  E,  respectively). 

-17  have  been  derived  from  a  variety  of  plant  compounds.  The  networks  chosen  for 
comparison  have  all  been  analyzed  using  either  identical  or  nearly  identical  protocols  in 
our  laboratories,  thereby  helping  to  ensure  that  comparisons  reflect  meaningful 
differences. 

•  Methods  of  Analysis 

-  Partial  Least  Squares  and  Least  Squares  regressions  run  and  compared 

-  Consistent  scheme  of  non-duplicative  parameters  established 

-  Minimum  often  parameters  needed  to  describe  a  single  monomer 

-  Five  parameters  describing  the  substitution  pattern  around  the  ring,  valued  between  0 

and  1 

-  Five  parameters  describing  the  bridge  structure,  valued  any  number  above  0. 

-  PLS  regressions  were  carried  out  using  the  SIMPLS  algorithm  in  MATLAB,  which  centers 

the  data,  but  does  not  require  normalization 
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Example:  Latent  Variables, 
PCA,  and  PLS 

A  S2 


In  this  data  set,  the  value  of  P  is 
weakly  correlated  with  and 
S2,  and  points  far  from  the 
origin  will  disproportionately 
influence  the  correlation 


V 

P  values:  blue  =  below  average;  green  =  about  average;  orange  =  above  average 
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Example:  Latent  Variables, 
PCA,  and  PLS 


A  principal  components 


P  values:  blue  =  below  average;  green  =  about  average;  orange  =  above  average 
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Example:  Latent  Variables, 
PCA,  and  PLS 


Notice  that  the  blue  dots 
(low  P)  are  mostly  on  the  left 
(low  S1  +  S2)  while  the 
orange  dots  (high  P)  are  on 
the  right  (high  Sx+  S2) 


A  reduced  rank  regression 
analysis  would  consider  the 
direction  having  the  greatest 
impact  on  the  value  of  P, 
which  is  actually  S2  +  S2 


PLS  is  an  attempt  to  describe 
the  variation  in  P  in  terms  of 
the  smallest  number  of 
combinations  of  factors  of  S 
by  combining  PCR  and  RRR 


P  values:  blue  =  below  average;  green  =  about  average;  orange  =  above  average 
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Cross-Validation 


•  Cross-validation  is  used  to  estimate  the  error  in  predicting  the 
properties  of  an  entirely  new  structure  based  on  a  model 
developed  by  any  regression  technique  (including  PLS). 

•  To  perform  cross-validation,  the  data  set  is  divided  into  two 
parts,  a  “training  set”  (usually  90%  of  data)  and  a  “test  set” 

(10%  of  data).  Regression  is  performed  with  the  training  set  to 
develop  a  cross-validation  model,  then  the  model  is  used  to 
predict  the  values  in  the  test  set.  The  process  may  be  repeated 
with  many  different  training  and  test  sets.  The  prediction  is 
characterized  by  the  mean  square  error  of  predictions. 
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Structures  of  Monomers 
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Structural  Parameter  Illustration 


NCO 
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Explanation  of  Structural  Parameters 


Parameter 

Explanation 

OCN 

Additional  -OCN  groups  per  monomer  (e.g.  1  for  tricyanates) 

K 

"Kinked"  -OCN  groups  (that  is,  -OCN  groups  -ortho  or  -meta  to  bridge  junction);  1  if  any 
such  groups  are  present  on  a  cyanated  aromatic  ring;  0  if  not  present  on  a  cyanated 
ring;  averaged  over  all  cyanated  aromatic  rings  in  monomer  (e.g.  0  for  the  typical  4,4'  - 
OCN  substitution  pattern  in  dicyanate  monomers) 

Me-mp 

Methyl  groups  in  positions  -meta  or  -para  to  -OCN  groups;  1  if  any  such  groups  are 
present  on  a  single  aromatic  ring;  0  if  not  present;  averaged  over  all  cyanated  aromatic 
rings  in  monomer 

Me-o 

Methyl  groups  in  positions  -ortho  to  -OCN  groups;  counting  /  averaging  rules  are  the 
same  as  for  Me-mp. 

OCH3 

Methoxy  groups  on  cyanated  aromatic  rings;  counting  /  averaging  rules  are  the  same 
as  for  Me-mp 

FB 

"Flexible"  non-hydrogen  atoms  in  bridge  "backbone"  structure;  the  "backbone"  of  the 
bridge  is  defined  as  the  set  of  non-hydrogen  atoms  bonded  along  the  shortest  possible 
path  connecting  two  cyanated  aromatic  rings;  if  more  than  one  such  path  exists,  atoms 
in  all  such  paths  count,  if  more  than  one  bridge  exists,  the  total  number  of  flexible  non¬ 
hydrogen  atoms  in  all  bridge  structures  is  counted,  with  each  counted  only  once; 
"flexible"  is  defined  as  having  sp3  hybridization  and  being  bonded  to  at  least  two  other 
atoms  (e.g.  not  a  chain  terminus)  not  in  a  ring;  sp3  atoms  in  ring  structures  considered 
on  a  case  by  case  basis. 
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Explanation  of  Structural  Parameters 


Parameter 


RB 


Explanation 


"Rigid"  non-hydrogen  atoms  in  bridge  "backbone"  structure.  "Rigid"  is  defined  as  any 
non-hydrogen  atom  not  meeting  the  criteria  for  "flexible"  with  sp3  atoms  in  ring 
structures  considered  on  a  case  by  case  basis.  Non-hydrogen  atoms  that  terminate 
chains,  such  as  methyl  carbons,  and  hydroxyl  oxygens,  count  as  "rigid".  Rules  for 
backbones  and  bridges  are  the  same  as  for  FB. 

"Flexible"  non-hydrogen  atoms  in  bridge  "side  group"  structures.  "Flexible"  is  defined 
as  in  FB,  and  the  rules  for  counting  paths  and  bridges  are  the  same  as  in  FB.  A  "side 
group"  atom  is  any  non-hydrogen  atom  in  a  structure  bridging  cyanated  aromatic  rings 
that  is  not  counted  as  part  of  the  "backbone" 

"Rigid"  non-hydrogen  atoms  in  "side  group"  bridge  structures;  definitions  and  rules 
are  the  same  as  for  FB,  RB,  and  FS  (e.g.  an  isopropylidene  bridge  will  have  FB=1  and 
RS=2). 

Asymmetric  bridge.  1  if  a  bridge  group  is  symmetrically  substituted  along  its 
backbone(s),  0  if  not,  averaged  over  all  bridge  structures.  Bridge  and  backbone  are 
defined  as  in  FB. 
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Partial  least  squares  regression  characteristics  as  a 
function  of  the  number  of  regression  components  present 


100% 

90% 

80% 

"O 

70%  £ 

60% 

LU 

50%  c 

o 

40%  .5 

30%  > 

20%  * 
10% 

0% 


•  Although  the  rms  error  associated  with  the  regression  itself  decreases  as  more 
factors  are  added,  "leave  one  out"  cross-validation  shows  that  the  predictive  error 
is  minimized  with  only  three  factors 
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Predicted  melting  point  values  as  a  function  of  the  number  of 
partial  least  squares  regression  components. 
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•  Tracking  of  individual  predictions  shows  that,  after  about  4  factors  are  present,  the 
role  of  additional  factors  is  primarily  to  "fix"  individual  errors  (such  as  VL-2).  These 
"individual  fixes"  add  no  general  predictive  power,  and  likely  degrade  it 
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Melting  point  regression  coefficients  as  a  function  of  the 
number  of  partial  least  squares  regression  components. 
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•  Although  the  predicted  melting  points  do  not  change  much,  the  coefficient  values 
become  unstable  in  about  half  the  cases  after  the  3rd  or  4th  factor  is  added, 
indicating  that  the  regression  is  being  distorted  more  and  more  in  order  to 
custom-fit  the  specific  data  set.  Such  distortions  destroy  the  general  predictive 
power  of  the  regression. 
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Linear  regression  model  coefficients  for  melting  point 


Regression  Types: 
PLS  =  Partial  Least 
Squares  (3  factor) 
OLS  =  Ordianry 
Least  Squares 


•  Note  how  ordinary  least  squares  multiple  regression  fits  of  the  data  set  produce 
very  large  coefficients  for  the  facotrs,  which  are  not  physically  realistic  for  a  linear 
model  of  melting  points.  In  contrast,  the  coefficient  values  derived  from  partial 
least  squares  are  reasonable. 
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Predicted  melting  points  of  monomers  not  included  in 

regression  models 
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•  Primaset®  LECy  ("LE")  melts  at  30  °C.  "AN-1"  is  a  liquid  at  room  temperature  (though  it 
may  be  supercooled,  so  it  could  melt  at  up  to  70  °C.  "DC-3"  is  also  a  liquid  at  room 
temperature.  "EUG-U"  and  "AN-U"  have  not  been  synthesized,  but  the  partial  least 
squares  model  predicts  "AN-U"  will  be  a  high-melting  solid,  whereas  ordinary  least  squares 
predicts  it  will  be  liquid  at  room  temperature. 
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Chemical  structures  of  the  as-yet  unsynthesized  monomer 

AN-U  and  EUG-U. 


AN-U 
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Summary 


•  Using  a  set  of  19  cyanate  ester  monomers,  17  of  which  may  be  derived  from 
renewable  sources,  structure  property  relationships,  in  the  form  of  partial  linear 
regression  models,  have  been  produced  for  the  monomer  melting  point  and 
compared  to  more  traditional  models  based  on  ordinary  least  squares 

•  The  partial  least  squares  method  was  able  to  provide  linear  models  based  on  a 
set  of  10  structural  parameters  that  predicted  melting  points  to  within  about  35  °C, 

•  The  predictive  power  of  the  patial  least  squares  models  was  compared  to  that  of 
ordinary  least  squares  and  shown  to  be  at  least  as  good. 

•  Analysis  of  ordinary  least  squares  models  for  the  monomer  melting  point  clearly 
showed  that  such  models  suffer  from  over-prediction. 

•  The  models  have  been  used  to  predict  the  properties  of  two  cyanate  ester 
systems  that  have  not  yet  been  synthesized,  but  could  easily  be  produced  for 
future  validation  experiments. 


DISTRIBUTION  A:  Approved  for  public  release.  Distribution  is  unlimited. 


21 


DISTRIBUTION  A:  Approved  for  public  release.  Distribution  is  unlimited. 


22 


